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Abstract. - For many externally driven complex systems neither the noisy driving force, 
nor the internal dynamics are a priori known. Here we focus on systems for which the time 
dependent activity of a large number of components can be monitored, allowing us to separate 
each signal into a component attributed to the external driving force and one to the internal 
dynamics. We propose a formalism to capture the potential multiscaling in the fluctuations and 
apply it to the high frequency trading records of the New York Stock Exchange. We find that 
on the time scale of minutes the dynamics is governed by internal processes, while on a daily or 
longer scale the external factors dominate. This transition from internal to external dynamics 
induces systematic changes in the scaling exponents, offering direct evidence of non- universality 
in the system. 



While it is hard to find a generally accepted definition of complex systems, most systems 
that are colloquially labeled " complex" include a large number of interacting constituents (or 
nodes) whose collective dynamics leads to emergent spatial and/or temporal structures [1]. 
The most studied examples that fit this paradigm include the cell, vehicular traffic or the 
' World- Wide- Web [2]. Very often these systems operate far from equilibrium and under the 
, influence of an external driving force. Yet, typically the mechanisms governing the internal 
^ • dynamics of these systems are not a priori known and even the driving force is not necessarily 
5^ , under the observer's control. Moreover, the separation of the system from its environment is 
often arbitrary, making difflcult to systematically distinguish the internal from the external 
degrees of freedom. 

(*) Email: eislerSmaxwell.phy.bme.hu 
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With the improvement of the measurement and information processing tools an increasing 
number of systems can be monitored through multichannel measurements, offering the pos- 
sibility to record and characterize the simultaneous time dependent behavior of many of the 
system's constituents. These advances in observational techniques offer an important scientific 
challenge: Can we design systematic methods that, taking advantage of the new datasets, can 
help us to map out the interactions and the dynamics of various complex systems? Consider- 
ing the large number of constituents and the complexity of the behavior displayed by them the 
above task is a truly ambitious undertaking, thus even partial progress is of major potential 
significance. 

Concepts like scaling, multiscaling and universality [3] have been found extremely useful 
in the characterization of complex phenomena, as they offer general relationships, leading to 
organizing and systematic categorization principles. Indeed, recent measurements focusing on 
the fluctuations at the "nodes" of several complex systems [4] indicate that the relationship 
between the standard deviation ai and time average (fi) of the fi{t) signal capturing the time 
dependent activity of node i — 1, . . . ,N follows the scaling law 



Yet, this finding leaves a number of important questions unanswered. For example, the mea- 
surements indicate that real systems belong to one of the two extreme universality classes 
characterized by either awl/2 (observed for the Internet and computer chip) or a w 1 
(highway traffic, river network and World- Wide- Web) , the former describing an endogenous, 
while the latter an exogenous dominance in the fiuctuations. Finding systems that display 
internal organizing principles leading to different exponents from these two extreme values 
would significantly enrich our understanding of complex dynamics. Furthermore, universal- 
ity in statistical physics is more than mere numerical agreement of exponents: In critical 
phenomena [5] whole scaling functions are expected to be universal. A natural question in 
this context can be formulated as follows: Are the distributions of the fluctuations character- 
ized by universal scaling functions? In this sense a signature of non-universality would be if 
some systems showed multifractality while others did not. Indeed, in the systems investigated 
thus far multiscaling appeared to be absent [6]. A major goal of this paper is to introduce 
the computational tools to uncover potential multiscaling in real systems. Finally, we apply 
these tools to the stock market, offering direct evidence of both non- universal exponents and 
multiscaling. 

To capture the dynamics of systems that can be monitored through several channels we 
decompose the signals into two components, that we will call the external and the internal 
part [7]. For this we deflne the system's global activity F{t) as a sum over the activity of all 
elements 



As F{t) characterizes the common trends in the systems's activity, in Q the individual, 
independent fluctuations of the components are averaged out. The components are expected 
to follow this " external" or averaged trend (which itself may be noisy) , while the fluctuations 
around F{t) are of "internal" origin, driven by interactions among the system's components. 
For a wide variety of cases the total activity fi (t) of node i can be split into an external activity 
defined as 



(1) 




(2) 



(3) 
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representing node i's expected share of the global activity F{t), and the deviations or internal 
activity is given by 



fr\t)^m~^-^^F{t), (4) 

where (■) denotes temporal average [8] , while Q the fluctuating component of a node's activity. 
By definition, (/™*) = 0. However, ijSJl contains the expected changes in the node's activity, 
given the overall changes in the system's activity. If the system is closed, then F{t) is time 
independent. Thus there are no changes in the external component fi^^{t) either. 
The standard deviation cr of the activity of the ith component is 




f^''-(fl'')) ), (5) 



where the label {£) represents tot, int or ext, indicating that cr^ is calculated from the total, 
internal or external signal, respectively. We will everywhere omit {£) for the total case. 

In order to investigate the multiscaling behavior of the fluctuations of total noise we propose 
the multiscaling relation: 

(I/. -(/.)!') = C|,(At,g) (6) 

This means that all qth order central moments of activity, which characterize fluctuations 
around the mean behavior, scale as power-laws with the mean total activity of the same 
element. In this notation, a — a{2). Note that analogous definitions can be given for the 
internal and the external case. In particular, as (|/f^*|'') = for external activity we 

have a{q;ext) = 1 [see Q], indicating that multiscaling can be present only in the internal 
or total fluctuations. In ((HJ a{q) is a formal analogue to the generalized dimensions H(q) for 
multifractal or multiaffine time series [10], defined as 

(I/. -(/.)!") = C^(z,g)At«^(^'«), (7) 

where At is the time above which the averages are to be taken. Combining (jSJ and (TJ one 
can eliminate a variable, obtaining: 

iUAtr'-^'''^ CF(.At,q) _ CT{i,q) 



(^j_^^^s^aiAt,q')H{i,q)/H{i,q') CpiAt, q')H(l,q) / H (l^q' ) (J^f^^^ qr^^H(i,q) / H{i,q') ' 

Here the r.h.s. does not depend on At, thus the l.h.s. should not either. Information on 
the temporal scaling goes into the r.h.s. in form of the exponent ratios H{q) / H{q'). However, 
if the time series does not show niultifractality, i.e., H{q) = H{q') then the simple moment 
ratios from ((BJ should not depend on the the averaging time: 



CF{At,q) {U,Atr'^'''' ^ Cril^q) 

Let us turn to an example where the tools introduced above prove useful. In the study of 
the financial market as a complex system statistical physics concepts turned out to be very 
powerful [11]. Enormous amount of data is available as every transaction is recorded on the 
stock market. A natural choice of constituents here are the stocks of firms that are publicly 
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traded. In order to keep the analogy with previously studied cases we have chosen the flow 
as the signals. On a given time horizon Ai, let the (total) activity or flow of the ith stock at 
time t be 

f^\t)= Y.i^)y^{T), (10) 

T(£[t,t + At] 

where r runs for all trades of the ith stock in the given interval. This corresponds to the 
coarse-graining of the individual events, or the so-called tick- by-tick data, ^^(t) is the price 
and Vi{T) is the traded volume for the trade at time r [12]. We used data with a minute 
resolution where the price within the minute was that of the last trading minute (which 
causes a negligible error). Hence, f^^{t) gives the total traded value of stock i between times 
t and t + At. Ai can be chosen as any multiple of 1 minute, and i = 0, At, 2At, .... In 
the following for simplicity of notation we will omit Ai's where they are kept constant, only 
indicating their value once. 

For empirical analysis we used the TAQ database [13] of the New York Stock Exchange 
for the period of 2000-2002, which after some filtering [14] contains TV = 2200 stocks. 

The a{q) scaling exponents were measured for fixed At, examples being shown in Fig. ^ 
The first striking observation is that for the second moment we find power law behavior over 
five orders of magnitude with an exponent which is significantly different from both 1/2 and 
1. Second, we find multiscaling (i.e., a dependence of a{q) on q, except for the trivial case of 
external activity), as shown in Fig. 13 Furthermore, the exponents show a strong dependence 
on the time horizon At. 




Fig. 1 - Scaling of g = 2 central moments of activities and fitted linear trends, (a) At = 10 minutes. 
a(2; tot) ^ a{2; int) = 0.773 ± 0.003. (b) At = 390 minutes = 1 trading day. a(2; tot) ^ a{2; int) = 
0.885 ± 0.005. Errors are estimated from the linear fit on the log-log plot. Note that although the 
errors are very low due to the large number of data points, there is pronounced unexplained variance 
around the expected scahng of total and internal noise. This can be reduced significantly if one takes 
the system structure (in our case the clustering of stocks into market sectors) into account [9]. 

In the activity (|10l) there are two sources of external impact, a random and a regular 
part. The latter is manifest in, e.g., different kinds of seasonalities and intraday patterns. 
Since we are interested in the fluctuations it is natural to detrend the data from such regular 
contributions. In this respect intraday patterns are particularly strong [11]: At the beginning 
and end of the trading day the activity is anomalously high. In addition, one finds a small 
irregularity in activity right after 10 a.m., which is a typical time for news arrival. The 
pattern, averaged for all full-length days in our dataset, is shown in Fig. |21 which is used for 
detrending, achieved by dividing traded values by the respective values of the trend pattern. 

We measured the scaling exponents a, using fits similar to those in Fig. ^ Simultaneously, 
we calculated the values of log(77(At)) (with rj = Oextj^int) averaged over all stocks. Both 
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Fig. 2 - The a(g) multiscaling exponents for all three types of activity. a{q;ext) = 1. (a) At = 10 
minutes, (b) At = 390 minutes = 1 trading day. 



results are shown in Fig. ^ indicating a significant At dependence. One finds that as we 
move from the minute scale to the weekly scale, the nature of fluctuations changes gradually 
toward the externally driven limit. This is in correspondence with results for the 77 ratio (see 
Fig. ^b)). Detrending of the data causes a little systematic decrease in a values for time 
horizons less than one day. 

An anomalous non-monotonicity is present in the raw data [see Fig. El^b)], while for the 
detrended data, we recover the expected tendency: As lAt increases, the external driving force 
gets stronger. The range of ry is similar to that in [4]. We find, that for Ai's larger than a few 
days, the external contribution to fluctuations saturates. 

As we change the time scales (At = 1 minute. . . 2 weeks) we observe a transition between 
two limits. With decreasing At, we get closer to the limiting endogenous behavior a = 
OLint < 1, and 77 <C 1. Yet, Fig. ^a) indicates, that even the hmiting value should 
significantly differ from 1/2 measured in several systems [4,15]. This suggests that despite 
earlier observations [4] the behavior = 1/2 is not universal, although the mechanism 
responsible for this non-trivial scaling is unknown. Furthermore, we observe a continuous 
dependence of the exponent aint on At with scaling spanning over five orders of magnitude, 
indicating that the observed exponents are likely not due to finite size crossover phenomena. 
With increasing At there is a growing role of external forces, and beyond the daily scale we 
reach the exogenous limit, where a — > 1 and 77 0.20. In this limit the dynamics is dominated 
by the external driving force. This mechanism is behind the so called Epps-effect [16], namely 




100 200 300 400 
minute of trading day 



Fig. 3 - The average intraday pattern of trading activity. The activity time series of one example day 
is given for comparison. 
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Fig. 4 - (a) Values of a as a function of the time scale At. There is a clear tendency present. The 
exponent grows from a lower value corresponding to the dominance of internal dynamics to nearly 1, 
which characterizes externally driven systems, (b) The ratio of the standard deviation of fluctuations 
originating from external and internal sources. Raw data shows an anomalous behavior, the result 
of peaks in trading activity at the beginning and the end of each day of trading. Results for data 
detrended with the average of this daily pattern show an effect similar to the one in the behavior 
of the exponent a. In the At = 1 . . . 390 minutes = 1 day interval, there is a faster increase in the 
contribution of external fluctuations. Beyond the scale of a few days this tendency saturates. 
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Fig. 5 - Histogram of the ratios of the internal and external standard deviations for all stocks, both 
for raw and detrended time series. The comparison shows, that external influence is stronger on a 
daily time scale than for fine resolution data, (a) At — 10 minutes. Detrending with the intraday 
pattern removes the spurious external driving force due to daily peaks in activity. This decreases the 
measured external contribution in fluctuations, (b) At — 390 minutes — 1 trading day. Detrending 
does not change behavior measured on a daily (or longer) scale. 



that in high frequency data the cross correlations between stock data are much less pronounced 
than in, say, daily ones: Correlations reflect the similarities in how different stocks react to 
external effects and this is covered for short time horizons by noisy internal dynamics. 

These results offer a coherent qualitative picture about market dynamics. The impact of 
incoming news needs a finite time to diffuse. Hence, on short time scales, the response to 
them is small. The factor that determines the fluctuations of trading activity is internal: it 
is the trading mechanism itself. On daily or longer scales, however, the internal fluctuations 
have smaller importance, and the market tends to move with the global activity. In periods 
of "business as usual" the natural human scale of one day seems to be needed to reach a kind 
of coherence: News and trends can be evaluated, informations are exchanged and collective 
decisions are made. Interestingly, the scaling of asset return distributions [11] also breaks 
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down on the scale of one day, see e.g. [17]. 

Distinguishing between endogenous and exogenous origins of market events is a central 
research problem [18]. Though theoretically and in agent-based model calculations it has been 
possible to investigate this question, its empirical study is extremely difficult. The appealing 
feature of the presented method is that it is based purely on multichannel time series and 
no knowledge of the internal structure or the dynamics goes into it. Thus it can serve as an 
empirical foundation for simulations and further theoretical work. Clearly, its applicability 
goes much beyond the examples discussed so far. 

In summary, we have introduced a multiscaling formalism to study fluctuations in complex 
systems. We find that non-universal behavior is manifested not only in a exponents different 
from the universal values 1/2 and 1 but also in the scaling properties of the distribution 
functions. The a exponents found for the flow data of 2,200 stocks on the NYSE showed 
a continuous dependence on the time horizon with good quality scaling over flve orders of 
magnitude. An additional signature of non-universal behavior is that multiscaling was found, 
in contrast to several other complex systems investigated in the similar fashion [6] . 
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